VAD for VoIP Using Cepstrum

نویسندگان

  • R. Venkatesha Prasad
  • H. S. Jamadagni
  • Abhijeet Sangwan
  • M. C. Chiranth
چکیده

As telephony services are being supported on Internet the focus is now on multiplexing many speech streams by exploiting the speech characteristics. The multiplexing gain is an important factor when applications such as teleconference service are ported on to the Internet. Here we discuss Voice Activity Detection (VAD) for Voice over Internet Protocol (VoIP) based on Cepstrum. VAD aids in saving bandwidth of a voice session. Such a scheme would be implemented in the application layer thus VAD is independent of the lower layers. The standard codecs would inherently have the VAD algorithms to reduce the bandwidth. However they are costly and computationally complex. In this paper, we compare the quality of speech, level of compression and computational complexity of our method of Cepstrum based VAD with the standard GSM and ITU-T G.729 codecs. Bandwidth reduction is achieved by not transmitting the non-speech packets. Our algorithm adapts to the varying background noise.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SVM-based Voice Activity Detection for Distributed Specch Recognition System

Voice Activity Detection (VAD) algorithms based on machine learning techniques have shown competitive results in the area of automatic speech recognition. This paper describes a new approach of VAD based on Support Vector Machines (SVM) for Distributed Speech Recognition (DSR) system. In the proposed scheme, the speech and the non-speech frames are detected from the compressed Mel Frequency Cep...

متن کامل

Phone-duration-dependent long-term dynamic features for a stochastic model-based voice activity detection

Accurate voice activity detection (VAD) is important for robust automatic speech recognition (ASR) systems. This paper proposes noise-robust VAD using long-term temporal information in speech. Long-term temporal information has been an ASR focus recently, but has not been investigated sufficiently for VAD. This paper describes an attempt to incorporate long-term temporal information into a feat...

متن کامل

VAD Techniques for Real-Time Speech Transmission on the Internet

We discuss techniques for Voice Activity Detection (VAD) for Voice over Internet Protocol (VoIP). VAD aids in reducing bandwidth requirement of a voice session thereby using bandwidth efficiently. Such a scheme would be implemented in the application Layer. Thus the VAD is independent of the lower layers in the network stack [3]. In this paper, we compare four time-domain VAD algorithms in term...

متن کامل

Mel-cepstrum-based steganalysis for VoIP steganography

Steganography and steganalysis in VoIP applications are important research topics as speech data is an appropriate cover to hide messages or comprehensive documents. In our paper we introduce a Mel-cepstrum based analysis known from speaker and speech recognition to perform a detection of embedded hidden messages. In particular we combine known and established audio steganalysis features with t...

متن کامل

Comparison of voice activity detection algorithms for VoIP

We discuss techniques for Voice Activity Detection (VAD) for Voice over Internet Protocol (VoIP). VAD aids in saving bandwidth requirement of a voice session thereby increasing the bandwidth efficiently. In this paper, we compare the quality of speech, level of compression and computational complexity for three time-domain and three frequency-domain VAD algorithms. Implementation of time-domain...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003